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Accelerated TCP (Transport Control Protocol) Stack Processing 
FIELD 

[0001] Embodiments of this invention relate to accelerated TCP (Transport 
Control Protocol) stack processing. 

BACKGROUND 

5 [0002] Networking has become an integral part of computer systems. 
Advances in network bandwidths, however, have not been fully utilized due to 
overhead that may be associated with processing protocol stacks. Overhead may 
result from bottlenecks in the computer system from using the core processing 
module of a host processor to perform slow memory access functions such as 

10 data movement, as well as host processor stalls related to data accesses missing 
the host processor caches. A protocol stack refers to a set of procedures and 
programs that may be executed to handle packets sent over a network, where the 
packets may conform to a specified protocol. For example, TCP/IP (Transport 
Control Protocol/Internet Protocol) packets may be processed using a TCP/IP 

15 stack. 

[0003] One approach to reducing overhead is to partition protocol stack 
processing. For example, TCP/IP stack processing may be offloaded onto a 
TCP/IP offload engine (hereinafter "TOE"). In TOE, the entire TCP/IP stack may 
be offloaded onto a networking component, such as a MAC (media access 
20 control) component, of an I/O subsystem, such as a NIC (network interface card). 
However, a TOE may not be scalable to support a large number of connections 
due to the memory requirements associated with storing contexts associated with 
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these connections. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0004] Embodiments of the present invention are illustrated by way of 
example, and not by way of limitation, in the figures of the accompanying 
5 drawings and in which like reference numerals refer to similar elements and in 
which: 

[0005] FIG. 1 illustrates a network. 

[0006] FIG. 2 illustrates a system according to one embodiment. 

[0007] FIG. 3 is a flowchart illustrating a method according to one 
10 embodiment. 

[0008] FIG. 4 is a flowchart illustrating a method according to another 
embodiment. 

DETAILED DESCRIPTION 

[0009] Examples described below are for illustrative purposes only, and are 
15 in no way intended to limit embodiments of the invention. Thus, where examples 
may be described in detail, or where a list of examples may be provided, it should 
be understood that the examples are not to be construed as exhaustive, and do 
not limit embodiments of the invention to the examples described and/or 
illustrated. 

20 [0010] FIG. 1 illustrates a network 100 in which embodiments of the 

invention may operate. Network 100 may comprise a plurality of nodes 102A, ... 

2 
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102N, where each of nodes 102A, ... 102N may be communicatively coupled 
together via a communication medium 104. As used herein, components that are 
"communicatively coupled" means that the components may be capable of 
communicating with each other via wirelined (e.g., copper wires), or wireless (e.g., 
5 radio frequency) means. Nodes 102A . . . 102N may transmit and receive sets of 
one or more signals via medium 104 that may encode one or more packets. 

[001 1] As used herein, a "packet" means a sequence of one or more 
symbols and/or values that may be encoded by one or more signals transmitted 
from at least one sender to at least one receiver. As used herein, a 

10 "communication medium" means a physical entity through which electromagnetic 
radiation may be transmitted and/or received. Communication medium 104 may 
comprise, for example, one or more optical and/or electrical cables, although 
many alternatives are possible. For example, communication medium 104 may 
comprise air and/or vacuum, through which nodes 102A . . . 102N may wirelessly 

15 transmit and/or receive sets of one or more signals. 

[0012] In network 100, one or more of the nodes 102A . . . 102N may 
comprise one or more intermediate stations, such as, for example, one or more 
hubs, switches, and/or routers; additionally or alternatively, one or more of the 
nodes 102A . . . 102N may comprise one or more end stations. Also additionally 
20 or alternatively, network 100 may comprise one or more not shown intermediate 
stations, and medium 104 may communicatively couple together at least some of 
the nodes 102A . . . 102N and one or more of these intermediate stations. Of 
course, many alternatives are possible. 

[0013] At least one of nodes 102A, 102N may comprise system 200, as 

3 
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illustrated in FIG. 2. System 200 may comprise host processor 202, host memory 
204, bus 206, and chipset 208. (System 200 may comprise more than one host 
processor 202, host memory 204, bus 206, and chipset 208, or other types of 
processors, memories, and busses; however, the former are illustrated for 
5 simplicity of discussion, and are not intended to limit embodiments of the 

invention.) Host processor 202, host memory 204, bus 206, and chipset 208 may 
be comprised in a single circuit board, such as, for example, a system 
motherboard 218. 

[0014] Host processor 202 may comprise a core processing module and 
10 other support modules that interface with other system elements. For example, a 
support module may include a bus unit that communicates with a memory 
controller on system 200. Host processor 202 may comprise, for example, an 
Intel® Pentium® microprocessor that is commercially available from the Assignee 
of the subject application. Of course, alternatively, host processor 202 may 
15 comprise another type of microprocessor, such as, for example, a microprocessor 
that is manufactured and/or commercially available from a source other than the 
Assignee of the subject application, without departing from embodiments of the 
invention. 

[001 5] Host processor 202 may be communicatively coupled to chipset 
20 208. Chipset 208 may comprise a host bridge/hub system that may couple host 
processor 202 and host memory 204 to each other and to bus 206. Chipset 208 
may also include an I/O bridge/hub system (not shown) that may couple the host 
bridge/bus system to bus 206. Chipset 208 may comprise one or more integrated 
circuit chips, such as those selected from integrated circuit chipsets commercially 

4 
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available from the Assignee of the subject application (e.g., graphics memory and 
I/O controller hub chipsets), although other one or more integrated circuit chips 
may also, or alternatively, be used. 

[0016] Bus 206 may comprise a bus that complies with the Peripheral 
5 Component Interconnect (PCI) Local Bus Specification, Revision 2.2, December 
18, 1998 available from the PCI Special Interest Group, Portland, Oregon, U.S.A. 
(hereinafter referred to as a "PCI bus"). Alternatively, bus 106 instead may 
comprise a bus that complies with the PCI-X Specification Rev. 1.0a, July 24, 
2000, (hereinafter referred to as a "PCI-X bus"), or a bus that complies with the 
10 PCI-E Specification Rev. PCI-E (hereinafter referred to as a "PCI-E bus"), as 
specified in "The PCI Express Base Specification of the PCI Special Interest 
Group", Revision 1 .0a, both available from the aforesaid PCI Special Interest 
Group, Portland, Oregon, U.S.A. Also, alternatively, bus 106 may comprise other 
types and configurations of bus systems. 

15 [0017] System 200 may additionally comprise circuitry 216. Circuitry 216 
may comprise one or more circuits to perform one or more operations described 
herein as being performed by TCP-A (Transport Control Protocol - Accelerated) 
driver 222 and/or network component 212. Circuitry 216 may be hardwired to 
perform the one or more operations, and/or may execute machine-executable 

20 instructions to perform these operations. For example, circuitry 216 may comprise 
memory 236 that may store machine-executable instructions 226 that may be 
executed by circuitry 216 to perform these operations. Instead of being comprised 
in host processor 202, or chipset 208, some or all of circuitry 216 may be 
comprised in a circuit card (not shown), and/or other structures, systems, and/or 

5 
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devices that may be, for example, comprised in motherboard 218, and/or 
communicatively coupled to bus 206, and may exchange data and/or commands 
with one or more other components in system 200. Circuitry 216 may comprise, 
for example, one or more digital circuits, one or more analog circuits, one or more 
5 state machines, programmable circuitry, and/or one or more ASIC's (Application- 
Specific Integrated Circuits). 

[0018] System 200 may additionally comprise one or more memories to 
store machine-executable instructions 226 capable of being executed, and/or data 
capable of being accessed, operated upon, and/or manipulated by circuitry, such 

10 as circuitry 216. For example, these one or more memories may include host 
memory 204, or memory 236. One or more memories 204, 236 may, for example, 
comprise read only, mass storage, random access computer-readable memory, 
and/or one or more other types of machine-readable memory. The execution of 
program instructions 226 and/or the accessing, operation upon, and/or 

15 manipulation of data by circuitry 216 may result in, for example, circuitry 21 6 

carrying out some or all of the operations described herein as being carried out by 
various hardware and/or software components in system 200. 

[0019] For example, machine-executable instructions 226 may comprise a 
set of instructions for an application 218; a set of instructions for operating system 
20 220; a set of instructions for TCP-A driver 222; and/or a set of instructions for 
DMA driver 224. In one embodiment, circuitry 216 of host processor 202 may 
execute machine-executable instructions 226 for TCP-A driver 222, for DMA 
driver 224, and for operating system 220. Machine-executable instructions 226 
may execute in memory by circuitry 216, such as in host processor 202, and/or by 

6 
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circuitry 216 in general. 

[0020] A method according to one embodiment is illustrated in the flowchart 
of FIG. 3 with reference to system 200 of FIG. 2. The method begins at block 
300, and continues to block 302 where network component 212 may receive an 
5 indication that one or more packets 228 (only one shown) have been received 
from network 100. Each packet 228 may comprise a header 230 and a pay load 
232. For each packet 228, network component 212 may split header 230 and 
payload 232 from packet 228, and post each 230, 232 to one or more post buffers 
214A, 214B. In one embodiment, header 230 may be posted to a first buffer such 
10 as post buffer 214A, and payload 232 may be posted to a second buffer such as 
post buffer 21 4B. The one or more packets 228 may be comprised in one or more 
groups, and each group of packets 228 may be transmitted and/or received over a 
connection. The one or more packets 228 may be received in response to a read 
data request from, for example, application 218. 

15 [0021] "Application" refers to one or more programs that use the network. 
An application 218 may comprise, for example, a web browser, an email serving 
application, a file serving application, or a database application. In conjunction 
with a read data request, application 218 may designate destination read buffer 
214C where application 218 may access the requested data. In conjunction with 

20 a transmit data request, application 218 may write data to be transmitted to source 
buffer 21 4D. 

[0022] "Network component" refers to any combination of hardware and/or 
software on an I/O (input/output) subsystem that may process one or more 
packets sent and/or received over a network. In one embodiment, I/O subsystem 
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238 may comprise, for example, a NIC (network interface card), and network 
component 212 may comprise, for example, a MAC (media access control) layer 
of the Data Link Layer as defined in the Open System Interconnection (OSI) 
model for networking protocols. The OSI model is defined by the International 
5 Organization for Standardization (ISO) located at 1 rue de Varembe, Case postale 
56 CH-1 211 Geneva 20, Switzerland. 

[0023] A "connection" as used herein refers to a logical pathway to facilitate 
communications between a first node on a network and a second node on the 
network. A connection may facilitate communications for one or more 

10 transactions between the first node and the second node. A "transaction" refers to 
a request to send or receive data, such as may be initiated by an application, such 
as application 218, on a system, such as system 200. Each connection may be 
associated with a protocol context. As used herein, "protocol context" refers to 
information about a connection. For example, the information may comprise the 

1 5 sequence number of the last packet sent/received, and amount of memory 
available. 

[0024] At block 304, network component 212 may notify TCP-A driver that 
one or more packets 228 have arrived. In one embodiment, network component 
212 may notify TCP-A driver 222 by notifying operating system 220 in accordance 
20 with an interrupt moderation scheme. An interrupt moderation scheme refers to a 
condition where an interrupt may be asserted for every n packets received by 
network component 212. Thus, if network component 212 receives n or more 
packets, network component 212 may notify operating system 220 that packets 
have arrived. Likewise, if network component 212 receives less than n packets, 
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network component 212 may instead wait until more packets are received before 
notifying operating system 220. In one embodiment, operating system 220 may 
then notify TCP-A driver 222 that packets are ready to be processed. 

[0025] At block 306, TCP-A driver 222 may perform packet processing for 
5 at least one of the one or more packets. Packet processing may be performed by 
the TCP-A driver 222 retrieving header 230 from post buffer 21 4A, parsing the 
header 230 to determine the protocol context associated with the current 
connection, and performing TCP protocol compliance. TCP protocol compliance 
may comprise, for example, verifying the sequence number of a received packet 
10 to ensure that the packet is within a range of numbers that was agreed upon 
between the communicating nodes; verifying the payload size to ensure that the 
packet is within a range of sizes that was agreed upon between the 
communicating nodes; ensuring that the header structure conforms to the 
protocol; and ensuring that the timestamps are within an expected time range. 

15 [0026] TCP-A driver 222 may fetch a next header to process prior to 
completing the processing of a current header. This may ensure that the next 
header is available in the host processor's caches (not shown) before the TCP-A 
driver 222 is ready to perform TCP processing on it, thereby reducing host 
processor stalls. The method may continue to block 308. 

20 [0027] In one embodiment, TCP-A driver 222 may additionally determine if 

a connection associated with a packet is to be accelerated prior to performing 

packet processing. TCP-A driver 222 may accelerate select connections. Select 

connections may comprise, for example, connections that are long-lived, or which 

comprise large data. If TCP-A driver 222 determines that network connection is to 

9 



WO 2005/104486 



PCT/US2004/021015 



be accelerated, TCP-A driver 222 may perform packet processing as described at 
block 306. If TCP-A driver 222 determines that network connection is not to be 
accelerated, the method may continue to block 318. 

[0028] At block 308, TCP-A driver 222 may determine if one or more 
5 payloads 232 placed in post buffer 214B are ready for placement. A payload 232 
may be ready for placement if, for example, the corresponding header has been 
successfully processed, and a read buffer, such as read buffer 214C, has been 
designated. If at block 308, payload 232 is not ready for placement, the method 
may continue to block 31 0. In one embodiment, TCP-A driver 222 may determine 
10 if there are one or more payloads 232 ready for placement at anytime. For 
example, if it is determined that payload 232 is not ready for placement, TCP-A 
driver 222 may wait for some period of time before it makes this determination 
again. Where payload 232 cannot be placed because a read buffer 21 4C does 
not exist, for example, TCP-A driver 222 may alternatively or additionally at 
15 anytime indicate to operating system 220 the presence of payload 232 ready to be 
placed. Operating system 220 may then designate a buffer, or may ask 
application 218 to designate a buffer. If there are one or more payloads ready for 
placement, the method may continue to block 312. 

[0029] At block 31 0, TCP-A driver 222 may determine if there are more 
20 packets 228 to process, for example in post buffer 21 4A, of the n packets for the 
current interrupt. If there are more packets 228 to process, the method reverts to 
block 306. If there are no more packets 228, and one or more packets 228 have 
not been previously placed, and are ready for placement, the method may 
continue to block 312. If there are no more packets 228 to process, and there are 

10 
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no previous packets 228 to place, the method may continue to block 314. 

[0030] At block 312, TCP-A driver 222 may perform one or more operations 
that result in a data movement module placing one or more corresponding 
payloads 232 into a read buffer, such as read buffer 21 4C. As used herein, a 
5 "data movement module" refers to a module for moving data from a source to a 
destination without using the core processing module of a host processor, such as 
host processor 202. A data movement module may comprise, for example, a 
DMA engine as described below. 

[0031] In one embodiment, for example, TCP-A driver 222 may send a 
10 request to DMA driver 224, and DMA driver 224 may schedule a request with 
DMA engine 210 to write the one or more payloads 232 from post buffer 21 4B to 
read buffer 21 4C. In another embodiment, TCP-A driver 222 may directly 
program DMA engine 210 to write the one or more payloads 232 from post buffer 
21 4B to read buffer 21 4C. DMA driver 224 may be a standalone driver, or part of 
15 some other driver, such as TCP-A driver 222. Rather than being part of chipset 
208, DMA engine 210 may be a support module of host processor 202. By using 
the DMA engine for placement of data, host processor 202 may be freed from the 
overhead of performing data movements, which may result in the host processor 
202 running at much slower memory speeds compared to the core processing 
20 module speeds. Following the DMA engine 210 scheduling, the method may 
revert to block 310 to determine if there are additional packets 228 to process. 

[0032] At block 314, TCP-A driver 222 may determine if there are any 

pending DMA completions for the current interrupt. Alternatively, TCP-A driver 

222 may look for DMA completions at anytime. A "pending completion" as used 
n 
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herein refers to the completion of a request. In one embodiment, a pending 
completion refers to the completion of a request to DMA engine 210 to write one 
or more payloads 232. If, at block 314, there are one or more pending DMA 
completions for the current interrupt, the method may continue to block 316. If at 
5 block 314 there are no pending DMA completions for the current interrupt, the 
method may continue to block 318. 

[0033] At block 316, TCP-A driver 222 may perform other tasks. Other 
tasks may include looking for more packets in a subsequent interrupt, setting up 
the DMA engine 210 to issue an interrupt upon completion of a last queued task 
10 for the current interrupt, or other housekeeping, such as transmitting data, and 
performing TCP timer related tasks. 

[0034] At block 318, TCP-A driver 222 may pass control back to operating 
system 220. If one or more packets 228 have still not been processed, operating 
system 220 may notify a TCP driver (not shown) rather than TCP-A driver 222, 
15 where the TCP driver may perform TCP stack processing by performing packet 
processing, and by using the core processing module of host processor 202 to 
perform data transfers. If all packets 228 have been processed, operating system 
220 may wait for a next interrupt. 

[0035] The method may end at block 320. 

20 [0036] A method according to another embodiment is illustrated in FIG. 4. 
The method begins at block 400 and continues to block 402 where operating 
system 220 may receive a request from application 218 to transmit data 234 
placed in buffer 214D. Operating system 220 may perform preliminary checks on 

12 
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data 234. Preliminary checks may include, for example, obtaining the associated 
protocol context. In a TCP/IP connection, for example, protocol context may 
comprise packet sequence numbers to identify the order of the packets, buffer 
addresses of buffers used to store data, and timer/timestamp information for 
5 retransmissions. 

[0037] At block 404, operating system 220 may notify TCP-A driver 222 that 
there is data 234 to be transmitted from buffer 21 4D. 

[0038] At block 406, TCP-A driver 222 may perform one or more operations 
that result in data 234 being transmitted to network component 212. For example, 

10 these one or more operations may include TCP-A driver 222 programming DMA 
engine 210 to transmit data 234 from source buffer 214D to network component 
212. Alternatively, TCP-A driver 222 may queue a buffer, such as queued buffer 
21 4E, to network component 212, where network component 212 may instead 
read data 234 from queued buffer 21 4E. Source buffer 21 4D may be designated 

15 by application 21 8, for example, and queued buffer 214E may be designated by 
network component 212, for example. 

[0039] In one embodiment, TCP-A driver 222 may program DMA engine 
210 to transmit data if the data is small, and TCP-A driver 222 may queue a 
buffer, such as queued buffer 214E, if the data is large. As used herein, "queuing 
20 a buffer" means to notify a component that there is a buffer from which it can 

access data. For example, TCP acknowledgment packets to acknowledge receipt 
of packets may typically be relatively small-sized packets, and may be sent by 
TCP-A driver 222 to network component 212 by TCP-A driver 222 programming 
DMA engine 210 to transmit data 234. As another example, storage applications 

13 
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that send large files over the network may be relatively large, and may therefore 
be sent by TCP-A driver 222 to network component 212 by queuing buffer 21 4E. 

[0040] At block 408, in response to receiving the data, network component 
212 may create one or more packets for transmission by packetizing the data. In 

5 one embodiment, network component 212 may packetize data by performing 
segmentation on the data. "Segmentation" refers to breaking the data into smaller 
pieces for transmission. In one embodiment, network component 212 may 
comprise a MAC, and segmentation may be referred to as a large send offload, 
wherein MAC frames may be created for transmission of data 234 over the 

10 network. Network component 212 may receive data directly from TCP-A driver 
222, or by accessing queued buffer 21 4E. 

[0041] The method may end at block 410. Thereafter, operating system 
220 may send a completion notification to application 218. Furthermore, source 
buffer 21 4D may be returned to application 218, and application may use the 
1 5 buffer for other purposes. 

[0042] Embodiments of the present invention may be provided, for 
example, as a computer program product which may include one or more 
machine-readable media having stored thereon machine-executable instructions 
that, when executed by one or more machines such as a computer, network of 
20 computers, or other electronic devices, may result in the one or more machines 
carrying out operations in accordance with embodiments of the present invention. 
A machine-readable medium may include, but is not limited to, floppy diskettes, 
optical disks, CD-ROMs (Compact Disc-Read Only Memories), and magneto- 
optical disks, ROMs (Read Only Memories), RAMs (Random Access Memories), 

14 
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EPROMs (Erasable Programmable Read Only Memories), EEPROMs (Electrically 
Erasable Programmable Read Only Memories), magnetic or optical cards, flash 
memory, or other type of media / machine-readable medium suitable for storing 
machine-executable instructions. 

[0043] Moreover, embodiments of the present invention may also be 
downloaded as a computer program product, wherein the program may be 
transferred from a remote computer (e.g., a server) to a requesting computer 
(e.g., a client) by way of one or more data signals embodied in and/or modulated 
by a carrier wave or other propagation medium via a communication link (e.g., a 
modem and/or network connection). Accordingly, as used herein, a machine- 
readable medium may, but is not required to, comprise such a carrier wave. 

Conclusion 

[0044] Therefore, in one embodiment, a method may comprise receiving an 
indication on a network component that one or more packets have been received 
from a network; the network component notifying a TCP-A (transport control 
protocol - accelerated) driver that the one or more packets have arrived; a TCP-A 
driver performing packet processing for at least one of the one or more packets; 
and the TCP-A driver performing one or more operations that result in a data 
movement module placing one or more corresponding payloads of the at least 
one of the one or more packets into a read buffer. 

[0045] Embodiments of the invention may significantly reduce TCP/IP 
processing overhead that may result from using the core processing module of a 
host processor. TCP/IP processing may be accelerated by using a data 
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movement module, such as a DMA engine, to move data from one buffer to 
another buffer. Since the core processing module of a host processor may be 
bypassed using a DMA engine, slow memory access speeds may be avoided. 
Furthermore, TCP/IP processing performed on the host processor may scale 
better than TOE processing because the number of contexts is not limited by TOE 
memory. 

[0046] In the foregoing specification, the invention has been described with 
reference to specific embodiments thereof. It will, however, be evident that 
various modifications and changes may be made to these embodiments without 
departing therefrom. The specification and drawings are, accordingly, to be 
regarded in an illustrative rather than a restrictive sense. 
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WHAT IS CLAIMED IS: 

1 . A method comprising: 

receiving an indication on a network component that one or more packets 
have been received from a network; 

5 the network component notifying a TCP-A (transport control protocol - 

accelerated) driver that the one or more packets have arrived; 

a TCP-A driver performing packet processing for at least one of the one or 
more packets; and 

the TCP-A driver performing one or more operations that result in a data 
10 movement module placing one or more corresponding payloads of 

the at least one of the one or more packets into a read buffer. 

2. The method of claim 1 , additionally comprising, in response to receiving an 
indication on a network component that one or more packets have been 
received from the network, the network component: 

15 splitting each of the one or more packets into a header and a payload; and 

posting each of the header and payload to one or more post buffers. 

3. The method of claim 2, wherein the TCP-A driver performs packet 
processing by processing each of the headers, the method additionally 
comprising fetching a next header of the one or more headers prior to 

20 completing the processing of the current header. 

4. The method of claim 1 , wherein said performing one or more operations 
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that result in a data movement module placing one or more corresponding 
payloads of the at least one of the one or more packets into a read buffer 
comprises sending a request to a data movement module driver to write the 
one or more corresponding payloads to the read buffer. 

5. The method of claim 1 , wherein said TCP-A driver performing one or more 
operations that result in a data movement module placing one or more 
corresponding payloads of the at least one of the one or more packets into 
a read buffer comprises programming the data movement module to write 
the one or more corresponding payloads to the read buffer. 

6. The method of claim 1 , wherein the data movement module comprises a 
DMA (direct memory access) engine. 

7. The method of claim 6, wherein the DMA engine resides on a chipset. 

8. The method of claim 6, wherein the DMA engine resides on a host 
processor as a support module. 

9. The method of claim 1 , additionally comprising: 

receiving a request on an operating system to transmit data over the 
network; 

the operating system notifying the TCP-A driver that there is data to be 
transmitted; 

the TCP-A driver performing one or more operations that result in the data 
being transmitted to the network component; 
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in response to receiving the data, the network component creating one or 
more packets for transmission by packetizing the data; and 

the network component transmitting the one or more packets over the 
network. 

10. An apparatus comprising: 

a network component capable of: 

receiving an indication on a network component that one or more 
packets have been received from a network; and 

notifying a TCP-A (transport control protocol - accelerated) driver 
that the one or more packets have arrived; and 

a TCP-A driver capable of: 

performing packet processing for at least one of the one or more 
packets; and 

the TCP-A driver performing one or more operations that result in a 
data movement module placing one or more corresponding 
payloads of the at least one of the one or more packets into a 
read buffer. 

1 1 . The apparatus of claim 1 0, additionally comprising an operating system 
capable of: 

receiving a request to transmit data over the network; and 
notifying the TCP-A driver that data is ready to be transmitted; 
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wherein: 

the TCP-A driver is capable of performing one or more operations 
that result in the data being transmitted to the network 
component; and 

5 the network component is capable of: 

creating one or more packets for transmission by packetizing 
the data in response to receiving the data; and 

transmitting the one or more packets over the network. 

12. The apparatus of claim 1 0, wherein in response to receiving an indication 
10 on a network component that one or more packets have been received 

from the network, the network component is additionally capable of: 

splitting each of the one or more packets into a header and a payload; and 
posting each of the header and payload to one or more post buffers. 

1 3. The apparatus of claim 1 2, wherein the TCP-A driver performs packet 
15 processing by processing each of the headers, and the TCP-A driver is 

additionally capable of fetching a next header of the one or more headers 
prior to completing the processing of the current header. 

14. A system comprising: 

a chipset having a DMA (direct memory access) engine, the chipset 
20 communicatively coupled to a TCP-A (Transport Control Protocol - 

Accelerated) driver of a processor and to a network component; 
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the network component capable of: 

receiving an indication that one or more packets have been received 
from a network; and 

notifying the TCP-A (transport control protocol - accelerated) driver 
5 that the one or more packets have arrived; and 

the TCP-A driver of the processor capable of: 

performing packet processing for at least one of the one or more 
packets; and 

performing one or more operations that result in a data movement 
10 module placing one or more corresponding payloads of the at 

least one of the one or more packets into a read buffer. 

1 5. The system of claim 14, additionally comprising an operating system of the 
processor capable of: 

receiving a request to transmit data over the network; and 
15 notifying the TCP-A driver that data is ready to be transmitted; 

wherein: 

the TCP-A driver is capable of performing one or more operations 
that result in the data being transmitted to a network 
component; and 

20 the network component is capable of: 
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creating one or more packets for transmission by packetizing 
the data in response to receiving the data; and 

transmitting the one or more packets over the network. 

16. The system of claim 14, wherein in response to receiving an indication on a 
5 network component that one or more packets have been received from the 

network, the network component is additionally capable of: 

splitting each of the one or more packets into a header and a payload; and 
posting each of the header and payload to one or more post buffers. 

1 7. The system of claim 1 6, wherein the TCP-A driver performs packet 

10 processing by processing each of the headers, and the TCP-A driver is 

additionally capable of fetching a next header of the one or more headers 
prior to completing the processing of the current header. 

1 8. A machine-readable medium having stored thereon instructions, the 
instructions when executed by a machine, result in the following: 

15 receiving an indication on a network component that one or more packets 

have been received from a network; 

the network component notifying a TCP-A (transport control protocol - 
accelerated) driver that the one or more packets have arrived; 

a TCP-A driver performing packet processing for at least one of the one or 
20 more packets; and 

the TCP-A driver performing one or more operations that result in a data 
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movement module placing one or more corresponding payloads of 
the at least one of the one or more packets into a read buffer. 

19. The machine-readable medium of claim 18, wherein in response to 
receiving an indication on a network component that one or more packets 
have been received from the network, the instructions additionally result in: 

splitting each of the one or more packets into a header and a payload; and 

posting each of the header and payload to one or more post buffers. 

20. The machine-readable medium of claim 19, wherein the TCP-A driver 
performs packet processing by processing each of the headers, the 
instructions additionally result in fetching a next header of the one or more 
headers prior to completing the processing of the current header. 

21 . The machine-readable medium of claim 18, wherein the instructions that 
result in performing one or more operations that result in a data movement 
module placing one or more corresponding payloads of the at least one of 
the one or more packets into a read buffer additionally result in sending a 
request to a data movement module driver to write the one or more 
corresponding payloads to the read buffer. 

22. The machine-readable medium of claim 18, wherein the instructions that 
result in said TCP-A driver performing one or more operations that result in 
a data movement module placing one or more corresponding payloads of 
the at least one of the one or more packets into a read buffer additionally 
result in programming the data movement module to write the one or more 
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corresponding payloads to the read buffer. 

23. The machine-readable medium of claim 18, wherein the data movement 
module comprises a DMA (direct memory access) engine. 

24. The machine-readable medium of claim 24, wherein the DMA engine 
5 resides on a chipset. 

25. The machine-readable medium of claim 24, wherein the DMA engine 
resides on a host processor as a support module. 

26. The machine-readable medium of claim 18, the instructions additionally 
result in: 

10 receiving a request on an operating system to transmit data over the 

network; 

the operating system notifying the TCP-A driver that there is data to be 
transmitted; 

the TCP-A driver performing one or more operations that result in the data 
15 being transmitted to the network component; 

in response to receiving the data, the network component creating one or 
more packets for transmission by packetizing the data; and 

the network component transmitting the one or more packets over the 
network. 
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